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BACKGROUND OF THE INVENTION 



1. Related Application 

This application claims the benefit of U.S. Provisional Patent Application Serial 
No. 60/257,499, entitled "Methods and Systems for Backing Up and Restoring Computer 
Data," filed December 21, 2000, which is incorporated herein by reference. 

2. The Field of the Invention 

The present invention relates to backing up and restoring computer data. More 
specifically, the present invention relates to systems and methods for minimizing the loss 
of computer data when restoring computer data that was lost due to the interruption. 

3. The Prior State of the Art 

With the advent of personal computers, businesses worldwide rely on computer 
data in performing daily business routines. However, a variety of events such as natural 
disasters, terrorism, or more mundane events such as computer hardware and/or software 
failures can occur while a computer is processing. These failures often result in causing 
the computer data to become corrupt, unreliable, or even lost. The corruption or loss of 
data, such as customer lists, financial transactions, business documents, business 
transactions, and so forth, can cause havoc to businesses by resulting in the loss of large 
investments of time and/or money. 

The loss or corruption of computer data is particularly devastating in the world of 
electronic commerce. The Internet has allowed individuals all over the world to conduct 
business electronically, thereby resulting in the continual upload of electronic orders. 
However, all copies of the orders are electronic and thus corruption or loss of the 
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• # 

electronic computer data can result in the loss of the business represented by the lost 
orders. 

Recognizing the commercial value of reliable computer data, businesses seek 
ways to protect their data and to reconstruct data that has become corrupt, unreliable, or 
lost. Traditional approaches of data protection and reconstruction have involved creating 
a backup copy of the computer data. While it is a simple procedure to preserve a backup 
copy of an individual file on a floppy disk, the creation of a backup copy becomes more 
difficult as the amount of data increases. 

Perhaps one of the simplest approaches to creating a backup copy of a large 
volume of computer data is to copy the data from a mass storage system to an archival 
device, such as one or more magnetic tapes. This method stores large amounts of 
computer data at the expense of immediate access to the data. The magnetic tapes are 
stored either locally or remotely, and the data is copied from the magnetic tapes to the 
mass storage system when problems arise with the mass storage system. 

While the use of an archival device to preserve data loss has the advantage of 
being relatively simple and inexpensive, it also has severe limitations. One such 
limitation is the amount of time that prevents user accessibility to the computer data 
while a backup copy is created and while data is reconstructed. The prevention of user 
accessibility has traditionally been required to ensure that no data has changed during the 
process. Because user inaccessibility of data is undesirable, backup copies are created 
less frequently, thereby causing the computer data in the backup copy to become stale. 
Similarly, transferring computer data from a magnetic tape to a mass storage system can 
become very lengthy because the computer data is transferred on a file-by-file basis. The 
time is further lengthened when the archival mass storage device is remotely located and 
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is not accessible over a network. These long reconstruction periods result in extended 
computer inaccessibility and cost businesses increased amounts of time and money. 

Another limitation of the traditional methods is that the backup copy represents 
data as it existed at a previous instant in time, meaning that the backup copy is not current 
with subsequent changes made to the original copy. The creation of a backup copy 
provides the security that a large portion of the computer data can be recovered in the 
event that the original copy becomes corrupt or lost. This limits the loss to include only 
the changes made to the original copy since the creation of the last backup copy. 
However, in some businesses, if the computer data is not current, the data is stale, 
unreliable, and even useless. This is particularly troubling in the financial world where 
rates and information change with great frequency. Thus, when the computer data 
becomes corrupt or lost, the businesses that rely on information that changes frequently 
are exposed to the risk of losing all of their valuable computer data. 

It would, therefore, represent an advancement in the art to have a system and 
method for backing up computer data that could further ^minimize the amount of 
computer data that is lost due to a computer failure or corruption of data) It would also 
represent an advancement in the art to have a^s^stem that allowed data to be backed up 
without terminating user access to the mass storage system. 
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SUMMARY OF THE INVENTION 



The present invention relates to systems and methods for backing up and restoring 
computer data. The invention enables computer data to be restored in an incremental 
manner in the event that data is corrupted or lost. In particular, if data is lost or 
corrupted, the data can be incrementally advanced through successively older states until 
a valid set of data is obtained. In this manner, data can be restored to a state that is newer 
than that associated with a full mirrored or archived copy of the data. ^Thus, full mirror or 
archiving operations on a volume of data can be less frequent without the risk of losing 
changes to the volume of data that have occurred since the last full mirror or archiving 
operation.) 



According to the invention, a mass storage device stores a plurality of data blocks 
at time T 0 . At T 0 , a mirrored copy of the data stored of the mass storage device may be 
made and stored such that the data at T Q can be conveniently restored, if necessary. 
Obtaining a mirrored copy of data is often a time or resource consuming process that is 
preferably conducted relatively infrequently. In order to preserve the data that is changed 
after the mirrored copy of data is created and before a potential event causing loss of 
data, data blocks that are to be overwritten after T 0 are stored in a preservation memory. 
In particular, if, after T 0 , a specified data block is to be overwritten as part of a write 
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ock is stored in a preservation memory prior to 
i the mass storage device. (In addition, the data 



/ is time^stampsd or otherwise marked so as to 




i or to designate a chronological position of the 




s operations. ) 
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As write operations are successively performed after time T 0 , the original data 
blocks that are to be overwritten are sequentially stored in the preservation memory with 
an associated time stamp. Thus, the data blocks that are overwritten or otherwise 
changed after T 0 are preserved in a preservation memory and the time or the order in 
which the data blocks were overwritten in the mass storage device is specified. 

In the event that certain data blocks in the mass storage unit device are lost or 
become corrupted, the data blocks stored in the preservation memory can be used to 
incrementally restore or reconstruct a valid set of data without reverting completely back 
to the data as it exists at time T 0 . If, for example, invalid or corrupted data is written to 
certain data blocks in the mass storage device after time T 0 , the original, valid data blocks 
are stored in a preservation memory as described above. Using the time stamps 
specifying the chronological sequence in which the data blocks stored in the preservation 
memory were overwritten in the mass storage device, the data blocks in the preservation 
memory are written to the current data stored in the mass storage device. 

After one or more data blocks from the preservation memory are written back to 
the mass storage device data in reverse chronological order, a valid set of data is 
eventually obtained at the mass storage device. Thus, the data blocks stored in the 
preservation memory are used to reconstruct the data without requiring the data to be 
reverted completely back to the data as it existed at time T 0 . 

It is noted that, according to the invention, complete mirror or archive operations 
performed on the full volume of data stored in the mass storage device can be less 
frequent than would be otherwise required in the absence of the invention. The data 
blocks stored on preservation memory are used to restored corrupted data to a state more 
recent than that associated with the most recent full mirrored copy of the data. It is also 
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noted that storing a sequence of overwritten data blocks in the preservation memory is 
typically less resource intensive than performing a full mirror operation on the data of the 
mass storage device. In addition, the data stored in the preservation memory in response 
to the write request can be a data block having fine granularity as opposed to being an 
entire file. For instance, a data block stored in a preservation memory response to a write 
operation in the mass storage device can be data associated with a single sector or another 
portion or segment of the mass storage device. Storing data in a preservation memory in 
this manner avoids the need of copying entire files in response to write operations and 
permits the data to be preserved independent of any file structure associated with the 
data. 

Additional features and advantages of the invention will be set forth in the 
description that follows, and in part will be obvious from the description, or may be 
learned by the practice of the invention. The features and advantages of the invention 
may be realized and obtained by means of the instruments and combinations particularly 
pointed out in the appended claims. These and other features of the present invention 
will become more fully apparent from the following description and appended claims, or 
may be learned by the practice of the invention as set forth hereinafter. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
In order that the manner in which the above recited and other advantages and 
features of the invention are obtained, a more particular description of the invention 
briefly described above will be rendered by reference to specific embodiments thereof 
that are illustrated in the appended drawings. Understanding that these drawings depict 
only typical embodiments of the invention and are not therefore to be considered to be 
limiting of its scope, the invention will be described and explained with additional 
specificity and detail through the use of the accompanying drawings in which: 

Figure 1 is a block diagram representing the computer system having an 
associated mass storage device and preservation memory in which the invention can be 
implemented. 

Figure 2 is a block diagram of the computer system of Figure 1, showing data 
stored in the mass storage device at time T 0 . 

Figure 3 illustrates a sequence of write operations in which data blocks that are to 
be overwritten are stored in a preservation memory. 

Figure 4 illustrates an operation in which data in a mass storage device is 
incrementally restored in response to a data corruption event. 

Figure 5 illustrates the operation of a virtual device that appears as if it contains a 
copy of data as it existed previously in a mass storage device. 

Figure 6 illustrates a computer system representing an example of a suitable 
operating environment for the invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention relates to backing up and restoring computer data. In 
particular, the invention enables corrupted data to be restored without requiring the data 
to be reverted completely back to a most recent full-mirrored copy of the data. Data 
blocks that are to be overwritten after a point in time in which a mirrored copy of the data 
has been created are stored in a preservation memory. The data blocks are associated 
with a time stamp other information that designates the time or the chronological order in 
which the data blocks were overwritten or the original data blocks in the mass storage 
device were overwritten. In general, the term "time stamp" refers to any such 
information designating the chronological order or the time of the data blocks stored in 
the preservation memory. The data blocks in the preservation memory can be used in the 
event of data corruption to incrementally roll the current, corrupted copy of the data in 
the mass storage device to a valid set of data. 

Figure 1 illustrates a computer system in which the invention can be practiced. 
The computer system of Figure 1 includes a computer 10, a mass storage device 12, a 
preservation memory 14, and an I/O device 16. Computer 10 can be any computer or 
other processing device that manages, generates, stores, or otherwise processes data. For 
instance, computer 10 can be a conventional personal computer, a server, a special 
purpose computer, or the like. Figure 6, discussed in greater detail below, sets forth 
details of one example of a computer in which the invention can be practiced. 

Mass storage device 12 is associated with computer 10 and is used to store data 
obtained from computer 10. In general, mass storage device 12 is^ajvritable, nonvolatile 
m^s_stomge_device. In particular, mass storage device 12 can be the hard drive 
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associated with a conventional personal computer or any other storage volume that is 
used to store data obtained from computer 10. 

Preservation memory 14 is a physical or logical device associated with computer 
10 in which data blocks that are to be overwritten in mass storage device 12 are stored. 
Preservation memory 14 can be a volatile device, such as a random access memory 
(RAM) or any other device that can store data blocks that are to overwritten in mass 
storage device 12. Although preservation memory 14 is illustrated as being a separate 
device in Figure 1, the preservation memory can be a parti tion or another portion of mass 
storage device 12. 

As shown in Figure 1, computer system 10 may include an input/output (I/O) 
device 16 that allows a user to manipulate data processed by computer 10 and stored in 
mass storage device 12. In general, computer 10 can have I/O device 16 or any other 
peripheral device that enables users to manipulate data. In addition, computer 10 
typically includes applications or software having data storage requirements satisfied by 
the mass storage device 12. In general, however, the invention is broad enough to extend 
to substantially any computer 10 and associated hardware and software, so long as there 
is a mass storage device 12 for storing data and preservation memory 14 for storing 
overwritten data blocks. 

Figure 2 illustrates the computer system of Figure 1 at a time T 0 , when mass 
storage device 12 includes a set of data blocks storing given data. In the example of 
Figure 2, for purposes of illustration, mass storage device 12 is shown as including five 
data blocks, namely, data blocks A, B, C, D, and E. At T 0 , it is assumed that a mirrored 
or other backup copy of the full volume of data stored in the mass storage device is 
created and stored. The mirrored or backup copy of the data obtained at To can be 
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generated in any desired manner, including conventional ways. In one embodiment, the 
mirrored copy or backup of the data at To can be obtained using the techniques disclosed 
in U.S. Patent No. 5,649,152, entitled "METHOD AND SYSTEM FOR PROVIDING A 
STATIC SNAPSHOT OF DATA STORED ON A MASS STORAGE SYSTEM" or U.S. 
Patent No. 5,835,953, entitled "BACKUP SYSTEM THAT TAKES A SNAPSHOT OF 
THE LOCATIONS IN A MASS STORAGE DEVICE THAT HAS BEEN IDENTIFIED 
FOR UPDATING PRIOR TO UPDATING", which are incorporated herein by reference. 
In general, U.S. Patent No. 5,649,152 discloses a technique for taking a snapshot of data 
as it exists at a selected moment by preserving data overwritten by write operations. U. 
S. Patent No. 5,835,953 generally describes techniques for obtaining snapshots or 
mirrored copies of data by transferring minimal amount of data to a remote location. 
Alternatively, the mirrored or backup copy of the data of the mass storage device 14 at To 
can be obtained using conventional data transfer processes. 

Depending on the nature of the data in mass storage device 12, it may be useful to 
obtain a mirrored or backup copy of the data at To that includes logically consistent data. 
This is particularly important when the data stored in mass storage device 12 represents 
transactions, each requiring a sequence of write operations or otherwise requiring a set of 
I/O operations to be performed in order to have a valid and useful set of data. Data is 
logically inconsistent when fewer than all of the necessary operations in a transaction or 
in another required sequence of I/O operations have been performed. Thus, depending on 
the nature of the data stored in mass storage device 12, the mirrored or backup copy of 
the data at To may need to be logically consistent data. 

Obtaining a mirrored or backup copy of the data at T 0 enables the data blocks A, 
B, C, D, and E stored at mass storage device 12 at To to be reconstructed simply by 
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accessing the mirrored or backup copy of the data. As stated above, however, full mirror 
or backup operations are often time or bandwidth intensive, such that relatively 
infrequent complete mirror or backup operations are desirable. As discussed above, and 
as described in more detail below, the present invention uses preservation memory 14 to 
store data blocks that enable data of mass storage device 12 to be incrementally restored 
in reverse chronological order without requiring the data to be reverted completely back 
to time To. Although periodic mirrored or backup copies of the data of the mass storage 
device 12 are often useful, the invention can also be practiced solely on the basis of data 
blocks being stored at preservation memory 14 without periodic mirrored or backup 
copies. 

In order to describe the manner in which the data is preserved in the preservation 
memory after time To, Figure 3 presents a specific example of a set of data blocks in the 
mass storage device and a sequence of the write operations applied thereto. As shown in 
Figure 3, the data blocks 20 stored in mass storage device at T G are designated as (A, B, 
C, D, E). At Ti.o, the computer 10 in Figure 2 issues a write request, whereby data block 
30 (Ai) is to overwrite existing the data block A, resulting in a set of data blocks 20a (Ai, 
B, C, D, E). In general, in response to a issued write request and prior to the data block 
in a mass storage device being overwritten with the new data block, the existing data 
block is stored in the preservation memory, along with a time stamp. 

In this example, at Ti.o, data block A is written to the preservation memory, along 
with a time stamp designating the time Ti.o as shown at 40a. Thus, even though data 
block A has been overwritten in a mass storage device, the data block A is stored in the 
preservation memory in the event that this data block is needed to reconstruct corrupted 
data at some future point. 
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At time Tu, data block 30b (Dj) is written to the data stored in the mass storage 
device, resulting in a set of data blocks 20b (A l? B, C, Dj, E). Prior to the write 
operation, the data block D, which is to be overwritten, is stored in the preservation 
memory, along with a time stamp designating the time Tu, as shown at 40b. 

At time T1.2, a data corruption event occurs, resulting in data block 30c (D x ) being 
written to the data stored in the mass storage device. As used in this example, the 
subscripts "x" and "y" represent corrupted data. The data corruption can have 
substantially any cause, such as data entry problems, software problems, hardware 
problems, or the like. The data corruption event at time T1.2 results in the set of data 
blocks 20c (Ai, B, C, D x , E). Prior to the write operation, the data block Di, which is to 
be overwritten, is stored in the preservation memory, along with a time stamp designating 
the time Ti. 2 , as shown at 40c. 

At time Tu, data block 30d (Bi) is written to the data stored in the mass storage 
device, resulting in a set of data blocks 20d (Ai, Bi, C, D x , E). Prior to the write 
operation, the data block B, which is to be overwritten, is stored in the preservation 
memory, along with a time stamp designating the time T 1.3, as shown at 40d. 

Finally, at time T1.4, another data corruption event occurs, resulting in data block 
30e (D y ) being written to the data stored in the mass storage device. The data corruption 
event at time T1.4 results in the set of data blocks 20e (A u B\, C, D y , E). Prior to the write 
operation, the data block D x , which is to be overwritten, is stored in the preservation 
memory, along with a time stamp designating the time T1.4, as shown at 40e. 

Thus, at time T1.4, the data blocks 20e existing at the mass storage device are (Aj, 
Bj, C, D y , E). As noted above, the data block D y represents corrupted data, which 
potentially makes all of the data stored in mass storage device unusable. At T1.4, 
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preservation memory also has stored therein data blocks (A, D, Di, B, D x ) and the 
corresponding time stamps (Ti.o, TYi, Ti.2, Tu, T1.4). 

Because the data corruption event has occurred since To, a valid set of data can be 
obtain by reverting completely to the data as it existed in the mass storage device at To by 
accessing any mirrored or backup copy of the full volume data that was created at To. 
However, reverting completely to the data as it existed at T 0 would result in a loss of all 
subsequent data written to the mass storage device after T 0 . Thus, according to the 
invention, in response to the data corruption event, the data blocks stored in the 
preservation memory are used to incrementally restore the data in the mass storage device 
in reverse chronological order to at a point at which valid, non-corrupted data exists as 
shown in Figure 4. The preservation memory in Figure 4 at time T 2 includes the set of 
data blocks and corresponding time stamps that existed in the preservation memory at 
time T1.4. Likewise, the mass storage device in Figure 4 at time T 2 includes a set of data 
blocks 20e (Ai, Bi, C, D y , E) that existed at T1.4 in Figure 3. 

The data restoration operation illustrated in Figure 4 begins with the current set of 
potentially corrupted data blocks 20e and the data blocks 40e stored in the preservation 
memory. The time stamps T1.0 - T1.4 are used to reconstruct the data that previously 
existed in the mass storage device in reverse chronological order. Thus, the most recently 
stored data block in preservation memory (D x ) is written to the set of data as it existed at 
time T 2 to roll back the set of data in the mass storage device to the state in which it 
existed at T1.3, resulting in a set of data (Ai, Bi, C, D x , E). 

It is noted that this operation of rolling back the data of the mass storage device as 
illustrated in Figure 4 can be performed using the actual data stored in the mass storage 
device or copy of thereof. 



- Page 14 - 



Docket No. 14113.26.1 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 




Upon rolling back to T1.3, it is determined, either by the computer or manually, 
that the set of data 20d of T1.3 still represents corrupted data, in that data block D x is 
corrupted. Because the data remains corrupted, the data of the mass storage device is 
further rolled back to time TY 2 using data block B of the preservation memory, which has 
the time stamp T1.3. Thus, the resulting data representing the state of the mass storage 
device at T1.2 is (Ai, B, C, D x , E) 20c. Because data blocks 20c also include corrupted 
data block D x , it is determined that the set of data blocks 20c represent corrupted data. 
Accordingly, the data is further rolled back to the state at which it existed at the mass 
storage device at Tu by writing to the data at the mass storage device the data block of 
the preservation memory that is next in reverse chronological order. In particular, data 
block Di having time stamp T1.2 is written to the data blocks of mass storage device, 
resulting in a set of data blocks 20b (Ai, B, C, Di, E), which represents the data as at 
existed at the mass storage device at T 1.1. 

At this point, it is determined that the data 20b (Ai, B, C, Di, E) represents valid, 
non-corrupted data. Thus, the data blocks of the preservation memory have been used to 
incrementally restore in reverse chronological order the data blocks of the mass storage 
device until such time that a valid set of data is obtained. It is also noted that the data 
blocks 20b (Ai, B, C, Dj, E) includes certain data (i.e. 2 , Aj, and Di) that would have not 
been included in the restored data had the data been reverted completely back to the 
mirrored or backup copy data of T 0 . Moreover, this more recent data is restored without 
requiring a sequence of full mirror or backup operations after To. 

In view of the foregoing, the operation for restoring the data generally involves 
applying the data blocks the preservation memory in reverse chronological order to a 
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current copy of the data blocks of the mass storage device until such time that the data 
blocks represent valid non-corrupted data. 

As can be understood, when the number of write operations is large or frequent, 
the number of data blocks stored in the preservation memory can increase rapidly. Thus, 
in practice, there is a trade-off between the frequency of full mirror or backup operations 
and the volume of data blocks that are stored in the preservation memory. In one 
embodiment, the frequency at which the full mirror or backup operations are performed is 
determined by the frequency at which the preservation memory is filled or reaches a 
certain size. Other words, as the volume of data stored in the preservation memory 
approaches the capacity of preservation memory, a full mirror or backup operation is 
performed on the mass storage device. This enables the data in a preservation memoiy to 
be discarded, since it is no longer needed in view of the fact that a more recent mirrored 
or backup copy of the data of the mass storage device has been created. 

While the invention has been described herein in reference to incrementally 
restoring corrupted data in reverse chronological order in response to a data corruption 
event, there are other uses for the basic methods of invention. For instance, the data 
blocks of the preservation memory can be used to roll the data in the mass storage device 
back to previous state for other reasons. Indeed, in substantially any situation in which a 
user wishes to obtain data as it existed previously in a mass storage device. 

In yet another embodiment, data stored in the preservation memory is combined 
with a mirrored or backup copy of data as it existed at a selected point in time to roll the 
data of the mass storage device back to a time prior to the creation of the mirrored or 
backup copy. As described above in reference to Figure 4, data blocks stored in a 
preservation memory prior to being overwritten in the mass storage device can be used to 
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incrementally roll back the data in the mass storage device. Thus, if the preservation 
memory includes a sequence of data blocks that were overwritten in the mass storage 
device prior to a subsequent mirrored or backup copy of the data of the mass storage 
device, the sequence of data blocks in the preservation memory can be used to 
incrementally advance the mirrored or backup copy back in time as desired. 

With overwritten data blocks stored in the preservation memory, the invention 
can be used to establish a virtual mass storage device ("virtual device") that permits data 
that existed previously in the mass storage device to be accessed. The virtual device 50 
of Figure 5 appears, from the standpoint of the user or an application (i.e., data access 
program 60) that accesses the virtual device, to contain the data that existed at a previous 
point in time. For example, the virtual device 50 can be accessed using an operating 
system and an associated file system as if the virtual device were another hard drive 
installed on computer 10 of Figure 1. 

One example of the manner in which the virtual device 50 can be used to access 
data as it existed at a previous point in time is illustrated in Figure 5, using the same set 
of data that has been previously described above in reference to Figures 3 and 4. In the 
example of Figure 5, it has been determined that a valid set of data can be obtained by 
rolling the data blocks 20e stored in the mass storage device 12 to the state in which they 
existed at Tu using the data blocks 40e stored in preservation memory 14, as has been 
described in reference to Figure 4. 

When virtual device 50 is used to access data, data access program 60 issues read 
requests 70 to virtual device 50 rather than addressing the requests specifically to mass 
storage device 12 or preservation memory 14. In this example, the read requests are used 
to access the most recent valid set of data that existed prior to the data corruption event 
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that introduced corrupted data blocks D x and D y to the set of data blocks in mass storage 
device 12. As described previously, it can be determined that a set of valid data that 
existed just prior to the introduction of corrupted data block D x can be obtained. In other 
words, the valid set of data at Tu, which is just prior to the data corruption event of Ti. 2 , 
can be obtained. 

Upon receiving a request for data, as it existed in the previous, non-corrupted 
state, the virtual device determines whether the data satisfying the request is to be 
obtained from mass storage device 12 or preservation memory 14. If the read request is 
directed to a data block having a non-corrupted version that has been stored in 
preservation memory 14 at or after the data corruption event (i.e., at or after T1.2), the 
oldest such data block is accessed to respond to the read request. For example, if data 
access program 60 requests a data block at the "D" position that is associated with the 
most recent set of valid data, the read request is processed by accessing the oldest non- 
corrupted data block stored in the preservation memory at or after T1.2. Thus, in response 
to the request for the data block "D", the virtual device accesses data block Di, shown in 
Crosshatch in Figure 5. In a similar manner, a request for the "B" data block is processed 
by accessing data block B stored in preservation memory 14, which is also shown in 
Crosshatch in Figure 5. 

If, however, the read request is directed to a data block that does not have a non- 
corrupted version stored in preservation memory 14 at or after the data corruption event 
(i.e., at or after T1.2), the corresponding data block is accessed from mass storage device 
12 to respond to the read request. For instance, requests for data blocks at positions "A", 
"C", and "E" are processed by accessing the corresponding data blocks Ai, C, and E from 
mass storage device 12, which are shown in Crosshatch in Figure 5. In this way, data 
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access program 60 can request the full set of data blocks that existed at the previous, non- 
corrupted state from virtual device 50 and receive, in response thereto, the set of data 
blocks (Ai, B, C ,Di, and E). Virtual device 50 identifies which data blocks to obtain and 
whether to obtain the data blocks from mass storage device 12 or preservation memory 
14. Using the data blocks, data access program 60 does not need to know the details of 
mass storage device 12, preservation memory 14, or the data blocks stored thereon, but 
instead simply issues a request to virtual device 50 as if virtual device 50 contained the 
prior set of data. 

In one embodiment, the prior set of data can be reconstructed in the manner set 
forth above in reference to Figure 5. Specifically, the prior set of non-corrupted data can 
be obtained by establishing a virtual device and reading the prior set of non-corrupted 
data as set forth above. Data access program 60 then makes the prior set of non- 
corrupted data available as needed for substantially any use. 

The embodiments of the present invention may comprise a special-purpose or 
general-purpose computer that includes various components, as discussed in greater 
detail below. Embodiments within the scope of the present invention may also include 
computer-readable media for carrying or having computer-executable instructions or data 
structures stored thereon. Such computer-readable media can be any available media that 
can be accessed by a general-purpose or special-purpose computer. By way of example, 
and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, 
CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage 
devices, or any other medium which can be used to carry or store desired program code 
means in the form of computer-executable instructions or data structures and which can 
be accessed by a general-purpose or special-purpose computer. 
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When information is transferred or provided over a network or another 
communications connection (either hardwired, wireless, or a combination of hardwired or 
wireless) to a computer, the computer properly views the connection as a computer- 
readable medium. Thus, any such connection is properly termed a computer-readable 
medium. Combinations of the above should also be included within the scope of 
computer-readable media. Computer-executable instructions comprise, for example, 
instructions and data which may cause a general-purpose computer, special-purpose 
computer, or special-purpose processing device to perform a certain function or group of 
functions. 

Figure 6 and the following discussion are intended to provide a brief, general 
description of a suitable computing environment in which the invention may be 
implemented. Although not required, the invention can be described in the general 
context of computer-executable instructions, such as program modules, being executed 
by computers in network environments. Generally, program modules include routines, 
programs, objects, components, data structures, etc. that perform particular tasks or 
implement particular abstract data types. Computer-executable instructions, associated 
data structures, and program modules represent examples of the program code means for 
executing steps of the methods disclosed herein. The particular sequence of such 
executable instructions or associated data structures represents examples of 
corresponding acts for implementing the functions described in such steps. 

Those skilled in the art will appreciate that the invention may be practiced in 
network computing environments with many types of computer system configurations, 
including personal computers, hand-held devices, mobile telephones, personal digital 
assistants ("PDAs"), multi-processor systems, microprocessor-based or programmable 
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consumer electronics, network PCs, minicomputers, mainframe computers, and the like, 
one example having been presented in Figure 1 . The invention may also be practiced in 
distributed computing environments where local and remote processing devices are 
linked (either by hardwired links, wireless links, or by a combination of hardwired or 
wireless links) through a communications network and both the local and remote 
processing devices perform tasks. 

With reference to Figure 6, an example system for implementing the invention 
includes a general-purpose computing device in the form of a computer 120, including a 
processing unit 121, a system memory 122, and a system bus 123 that couples various 
system components including system memory 122 to processing unit 121. Computer 120 
and the associated component illustrated in Figure 6 represent a more detailed illustration 
of computer 10 of Figure 1. System bus 123 may be any of several types of bus 
structures including a memory bus or memory controller, a peripheral bus, and a local bus 
using any of a variety of bus architectures. System memory may include read only 
memory ("ROM") 124 and random access memory ("RAM") 125. A basic input/output 
system ("BIOS") 126, containing the basic routines that help transfer information 
between elements within the computer 120, such as during start-up, may be stored in 
ROM 124. 

Computer 120 may also include a magnetic hard disk drive 127 for reading from 
and writing to a magnetic hard disk 139, a magnetic disk drive 128 for reading from or 
writing to a removable magnetic disk 129, and an optical disk drive 130 for reading from 
or writing to removable optical disk 131 such as a CD-ROM or other optical media. 
Magnetic hard disk drive 127, magnetic disk drive 128, and optical disk drive 130 are 
connected to system bus 123 by a hard disk drive interface 132, a magnetic disk drive- 
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interface 133, and an optical drive interface 134, respectively. The drives and their 
associated computer-readable media provide nonvolatile storage of computer-executable 
instructions, data structures, program modules and other data for computer 120. 
Although the example environment described herein employs a magnetic hard disk 139, a 
removable magnetic disk 129 and a removable optical disk 131, other types of computer 
readable media for storing data can be used, including magnetic cassettes, flash memory 
cards, digital versatile disks, Bernoulli cartridges, RAMs, ROMs, and the like. 

Program code means comprising one or more program modules may be stored on 
hard disk 139, magnetic disk 129, optical disk 131, ROM 124, or RAM 125, including an 
operating system 135, one or more application programs 136, other program modules 
137, and program data 138. A user may enter commands and information into computer 
120 through keyboard 140, pointing device 142, or other input devices (not shown), such 
as a microphone, joy stick, game pad, satellite dish, scanner, or the like. These and other 
input devices are often connected to processing unit 121 through a serial port interface 
146 coupled to system bus 123. Alternatively, the input devices may be connected by 
other interfaces, such as a parallel port, a game port or a universal serial bus ("USB"). A 
monitor 147 or another display device is also connected to system bus 123 via an 
interface, such as video adapter 148. In addition to the monitor, personal computers 
typically include other peripheral output devices (not shown), such as speakers and 
printers. 

Computer 120 may operate in a networked environment using logical connections 
to one or more remote computers, such as remote computers 149a and 149b. Remote 
computers 149a and 149b may each be another personal computer, a server, a router, a 
network PC, a peer device or other common network node. Remote computers 149a and 
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149b may include many or all of the elements described above relative to the computer 
120, although only memory storage devices 150a and 150b and their associated 
application programs 136a and 136b are illustrated in Figure 6. The logical connections 
depicted in Figure 6 include a local area network (LAN) 151 and a wide area network 
(WAN) 152 that are presented here by way of example and not limitation. Such 
networking environments are commonplace in office-wide or enterprise-wide computer 
networks, intranets and the Internet. 

When used in a LAN networking environment, computer 120 is connected to 
local network 151 through a network interface adapter 153 or similar adapter. When 
used in a WAN networking environment, computer 120 may include a modem 154, a 
wireless link, or other means for establishing communications over the wide area network 
152, such as the Internet. Modem 154, which may be internal or external, is connected to 
system bus 123 via the serial port interface 146. In a networked environment, program 
modules depicted relative to computer 120, or portions thereof, may be stored in the 
remote memory storage device. It will be appreciated that the network connections 
shown are exemplary and other means of establishing communications over wide area 
network 152 may be used. 

The present invention may also be embodied in other specific forms without 
departing from its spirit or essential characteristics. The described embodiments are to be 
considered in all respects only as illustrative and not restrictive. The scope of the 
invention is, therefore, indicated by the appended claims rather than by the foregoing 
description. All changes, which come within the meaning and range of equivalency of 
the claims, are to be embraced within their scope. 

What is claimed and desired to be secured by United States Letters Patent is: 
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